Creating multimedia content for animation drawings by synchronizing animation drawings to audio and textual data

ABSTRACT

A method and system for creating multimedia prototype presentations use the linear compression or stretching of playback speed record of creating and editing graphic images, which does not lead to degradation of quality perception. When using manually created drafts, schemes and drawings as graphic images of the presentation, the playback of record of their creating and editing process can be sped up or slowed down in a broad time range without losing quality perception of the visual content. The synchronization of the audio and the video track is made by a linear compression or stretching playback speed record of creating and editing the graphic images until the playback duration of the frame visual content and the duration of its sound match.

CROSS-REFERENCE TO RELATED APPLICATION

This application is a US National Phase of PCT/RU2013/000960 filed onOct. 28, 2013.

BACKGROUND OF THE INVENTION

Field of the Invention

This invention relates to a method and system for creating multimediapresentation prototypes. The invention also relates to informationtechnology field and is directed to creating audiovisual multimediapresentations consisting of sequences of scenes (frames).

Description of the Related Art

Creating multimedia presentations takes up time and resources. There aremethods and devices for creating multimedia presentations described inU.S. Pat. Nos. 8,078,967 and 7,941,757, disclosing definition of scenes(frames) sequence, frame text and visual content and an order oftransition between the scenes. These patents also disclose combining andcomposition of text, visual imagery and audio track into oneaudio-visual file. U.S. Pat. No. 7,546,544 also discloses creation ofmultimedia presentations. All of the conventional solutions have thesame shortcoming—a high manpower effort and a complexity of use.

Accordingly, a method and system for efficient creation of multimediapresentation prototypes are desired.

SUMMARY OF THE INVENTION

Accordingly, the present invention is related to a system and method forcreating multimedia prototype presentations that obviates one or more ofthe disadvantages of the related art.

A method for creating multimedia prototype presentations uses a linearcompression or stretching of playback speed record of creating andediting graphic images does not lead to degradation of qualityperception. When using manually created drafts, schemes and drawings asgraphic images of the presentation, the playback of record of theircreating and editing process can be sped up or slowed down in longtimeframes without losing quality perception of the visual content. Thesynchronization of the audio and the video track is made by a linearcompression or stretching playback speed record of creating and editingthe graphic images until the playback duration of the frame visualcontent and the duration of its sound match.

Additional features and advantages of the invention will be set forth inthe description that follows, and in part will be apparent from thedescription, or may be learned by practice of the invention. Theadvantages of the invention will be realized and attained by thestructure particularly pointed out in the written description and claimshereof as well as the appended drawings.

It is to be understood that both the foregoing general description andthe following detailed description are exemplary and explanatory and areintended to provide further explanation of the invention as claimed.

BRIEF DESCRIPTION OF THE ATTACHED FIGURES

The accompanying drawings, which are included to provide a furtherunderstanding of the invention and are incorporated in and constitute apart of this specification, illustrate embodiments of the invention andtogether with the description serve to explain the principles of theinvention.

In the drawings:

FIG. 1 illustrates implementation of device for creatingmultimedia-presentations prototypes;

FIG. 2 illustrates a screenshot of the system in different modes;

FIG. 3 illustrates a screenshot of the system in a “Presentationsmanagement” mode;

FIG. 4 illustrates a screenshot of the system in a “Text editing” mode;

FIG. 5 illustrates a screenshot of the system in a “Sound editing” mode;

FIG. 6 illustrates a screenshot of the system in a “Graphic editing”mode;

FIG. 7 illustrates an example of a computer or a server on which theinvention may be implemented.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Reference will now be made in detail to the preferred embodiments of thepresent invention, examples of which are illustrated in the accompanyingdrawings.

The present invention is directed to a method for creating multimediapresentation prototypes. A user creates a prototype (a light version) ofa multimedia presentation for using it for personal or professionalpurposes. For example, to show a viewer of the presentation a directionto an unknown place, to create a simple garden furniture assemblyinstruction or to get proposals on developing business process of acompany across to work colleagues.

A multimedia presentation includes a video track in a form of graphicimages (screens, frames, scenes) sequence and an audio track with avoice dubbing of the presentation text going along with the videosequence. A base for creating an audio track is a text of thepresentation prepared in advance, as well as a transcription and editingof unprocessed and unprepared speech.

The text of the presentation and the multimedia-presentation itselfconsist of elementary indivisible substantial fragments sequence inequivalent to creation technology of the television programs named“frames.” Each frame complies with its text (a fragment of thepresentation general text), its audio track (a fragment of thepresentation general track) and its graphic images in the form ofvisualization process of creating and editing graphic images.

For the right perception of the presentation content, the displayed onthe screen record of a process of creating and editing the frame imagesmust be synchronized with the with text of the current frame. In theexemplary embodiment, a base for multimedia presentation creation is anaudio track with the record of the synchronized text of thepresentation. A linear compression or stretching of a record playbackspeed of the voice synchronization leads to essential degradation ofquality perception of the synchronized text and the presentation byviewers.

Instead, in the exemplary embodiment, the linear compression orstretching of playback speed record of creating and editing graphicimages does not lead to degradation of quality perception. When usingmanually created drafts, schematics and drawings as graphic images ofthe presentation, the playback of record of their creating and editingprocess can be sped up or slowed down in long timeframes without losingquality perception of the visual content. The synchronization of theaudio and the video track is made by a linear compression or stretchingplayback speed record of creating and editing the graphic images untilthe playback duration of the frame visual content and the duration ofits sound match.

FIG. 1 illustrates implementation of a device for creatingmultimedia-presentations prototypes where: 101—a device; 102—a screenwith a sensor for detecting a position of a finger touch; 103—a stylusfor screen interaction; 104—graphic images drawn and displayed on thescreen; 105—a device microphone; 106—a device speaker. An exemplaryembodiment is designed for use on a portable personal computing devices101 with a touch screen 102 and geared towards visually displayed andvisually controlled process of creating and editing the presentationcontent. Operations with the touch screen that do not require highaccuracy of contact point positioning can be made by fingers. Operationsrequiring enhanced accuracy of contact point positioning (e.g., drawing)can be made by the stylus 103.

FIG. 2 illustrates screenshot of the device in different modes, where:201—device mode of operations selection panel; 202—a work field of theselected mode; 203—a graphical or textual content in a work field; 204—afield-button of mode “Presentations management”; 205—a field-button ofmode “Text editing”; 206—a field-button of mode “Sound editing”; 207—afield-button of mode “Graphic editing.”

FIG. 3 illustrates a screenshot of the device in “Presentationsmanagement” mode, where: 301—icons (final drawings) of the presentationsprototypes in a work field; 302—minimized button-field of the currentmode; 303—button “Create presentation”; 304—button “Open presentationfor editing”; 305—button “Launch demonstration of presentation”;306—button “Delete presentation”; 307—button “Save presentation as”.

FIG. 4 illustrates screenshot of the device in “Text editing” mode,where: 401—a text of the prototype in a work field; 402—markers offrames' text edges; 403—a button “Import text”; 404—a button “Verbaldictation of text with recognition”; 405—a button “Enter text fromon-screen keyboard”; 406—a button “Insert marker of frames text edges.”The markers of the text markup 402 are used for visual splitting of thepresentation text and consequently the presentation itself into theframes.

FIG. 5 illustrates screenshot of the device in “Sound editing” mode,where: 501—a button “Import sound from file and markup sound intowords”; 502—a button “Record sound from microphone and markup sound intowords”; 503—a text of a part of the presentation for which an audiotrack is already marked up into the frames; 504—a text of a part of thepresentation for which an audio track is not marked up into the framesyet; 505—process visualization and animation and marked up edges of anaudio track.

FIG. 6 illustrates a screenshot of a system in “Graphic editing” mode,where: 601—a button “Turn on/off figures automatic recognition mode”;602—a button “Turn on/off text narrator”; 603—a button “Moveforward/back through frames”; 604—graphic images being created;605—graphic tools and palette.

According to an exemplary embodiment, the multimedia prototypes arecreated according to the following scenarios.

-   -   “Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”;    -   “Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”;    -   “Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”.

The method of creating multimedia presentations prototypes using thescenario“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”includes determination of order of frames, their textual and visualcontent and the order of transition between the frames, preparation andmutual composition of text, visual content and audio track of multimediapresentation into an audio-video file by using a personal computingdevice (e.g., Smartphone, tablet, laptop, PC, etc.). The preparation andmutual arrangement of text, visual content and audio track isimplemented as follows:

1) Prepare a single coherent presentation text and enter it into thedevice 101;

2) Split the presentation text into fragments corresponding to thepresentation frames, place the marker of the text markup into framesbetween all neighboring frames 402;

3) Record (load) an audio track with a verbal voice content of thepresentation;

4) Split the audio track with verbal voice content of the presentationinto frames;

5) Create visual imagery of frames for each frame of the presentationby:

-   -   creating and editing graphical content of the frame;    -   recording a visual representation of all actions of creating and        editing visual content.

6) Arrange the presentation prototype by:

a) Preparing its own configuration file for each frame and:

-   -   determining the play time of the frame within the audio track;    -   making uniform time scaling (shrink/expand) of visual        representation for creating and editing graphic content of the        frame visual imagery until the duration of the frame visual        representation and the duration of its audio content match;    -   determining data of frame visualization (aspect ratio,        resolution, color, transparency, FPS, etc.);    -   playing back the frame audio track and the time scaled visual        representation of the frame content simultaneously and recording        the resulting mix of sound and animation into a configuration        file of the frame;    -   playing back the configuration file of the frame on a screen and        evaluating it. If the result is positive—saving the file, if the        result is negative—editing text, sound, frame figures and the        repeating configuration of that frame.

b) Combine the configuration files of the frames with the required orderof the frames sequence into a single audio-visual file of thepresentation;

7) Review the configuration audio-visual file of the presentation forthe evaluation of its level of completeness and correspondence to thepurposes of the presentation.

According to the evaluation results:

a) Make the required correction of text, sound or visual content of theframes in a particular work session and repeatedly configure thepresentation prototype;

b) Save the configuration file of the presentation in a format allowingfor future editing of text, sound and visual imagery of frames in thenext work session;

c) Convert the configuration file of the presentation into a video-filewithout possibility of future editing of text, sound or images of theframes.

The method of creating prototypes of multimedia presentations in ascenario“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”includes determination of frames order, textual and visual content andthe order of transition between frames, preparation and mutualcomposition of text, visual content and audio track of multimediapresentation into an audio-video file by using personal computing device(e.g., Smartphone, tablet, laptop, PC). The preparation and mutualarrangement of text, visual content and audio track is implemented asfollows:

1) Prepare a single coherent presentation text and enter it into thedevice 101;

2) Split the presentation text into fragments corresponding to thepresentation frames, place a marker of the text markup into framesbetween all the neighboring frames 402;

3) Create visual content of the frames for each frame of thepresentation by:

-   -   creating and editing graphic content of the frame;    -   recording visual representation of all actions of creating and        editing graphic content.

4) Record (load) an audio track with verbal voice content of thepresentation;

5) Split the audio track with verbal voice content of the presentationinto frames;

6) Arrange the presentation prototype, and:

a) Prepare its own configuration file for each frame by:

-   -   determining the time of a frame sound playback within the audio        track;    -   making a uniform time scaling (shrink/expand) of visual        representation of the graphic content of the frame imagery until        the duration of the frame visual representation and the duration        of its sound match;    -   determining data of frame visualization (i.e., aspect ratio,        resolution, color, transparency, FPS, etc.);    -   playing back the frame audio track and the time scaled visual        representation for creating and editing graphic contents of the        frame and recording the resulting mix of sound and animation        into a configuration file of the frame;    -   playing back the configuration file of the frame on a screen and        evaluating it. If the result is positive—saving the file, if the        result is negative—going back to editing text, sound, frame        graphics and the repeated configuration of that frame;

b) Combine the configuration files of the frames with the required orderof the frames sequence into a single audio-visual file of thepresentation.

7) Review the arrangement audio-visual file of the presentation for theevaluation of its level of completeness and correspondence to thepurposes of the presentation. According to the evaluation results:

a) Make the required correction of text, sound or visual content of theframes in a particular work session and repeatedly configure thepresentation prototype;

b) Save the configured file of the presentation in a format allowing forfuture editing of text, sound and visual imagery of the frames in thenext work session;

c) Convert the configuration file of the presentation into a video-filewithout possibility of future editing text, sound or images of theframes.

The method of creating prototypes of multimedia presentation in ascenario“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”includes determination of frames' order, their textual and visualcontent and the order of transition between frames, preparation andmutual composition of text, visual content and audio track of multimediapresentation into an audio-video file, by using personal computingdevice (e.g., Smartphone, tablet, laptop, PC). Preparation and mutualarrangement of text, visual content and audio track is implemented asfollows:

1) Record (load) an audio track with a verbal voice content of thepresentation;

2) Prepare a single coherent presentation text and enter it into thedevice;

3) Split the presentation text into fragments corresponding to thepresentation frames, placing the marker of the text markup into framesbetween all the neighboring frames;

4) Split the audio track with a verbal voice content of the presentationinto frames;

5) Create visual imagery of the frames for each frame of thepresentation by:

-   -   creating and editing graphic content of the frame;    -   recording visual representation of all actions of creating and        editing graphic content.

6) Configure the presentation prototype and:

a) Prepare its own configuration file for each frame by:

-   -   determining the time of a frame playback within the audio track;    -   making uniform the time scaling (shrink/expand) of visual        representation of creating and editing graphic figures of the        frame visual imagery until the duration of the frame visual        representation and the duration of its sound matches;    -   determining data of frame visualization (aspect ratio,        resolution, color, transparency, FPS, etc.);    -   simultaneously playing back the frame audio track and the time        scaled visual representation for creating and editing graphic        content of the frame and recording the resulting mix of sound        and animation into a configuration file of the frame;    -   playing back the configuration file of the frame on a screen and        evaluating it. If the result is positive—save the file, if the        result is negative—going back to editing text, sound, frame        figures and the repeated configuration of the frame;

b) Combine the configuration files of the frames with the required orderof the frames sequence into a single configuration audio-visual file ofthe presentation.

7) Review the configuration audio-visual file of the presentation forthe evaluation of its level of completeness and correspondence to thepurposes of the presentation. According to the results of theevaluation:

a) Make the required correction of text, sound or visual imagery of theframes in a particular work session and repeatedly configure thepresentation prototype;

b) Save the configuration file of the presentation in a format allowingfor future editing of text, sound and visual imagery of the frames inthe next work session;

c) Convert the configuration file of the presentation into a video-filewithout possibility of future editing text, sound or images of theframes.

Additionally, in all three of above mentioned scenarios

(“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”,

“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”and“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”)the audio track of the audio voice content of the presentation is splitinto frames as follows:

1) To display:

a) Content of frames' text and audio (synchronized at that particularmoment) visually presented as segments (bars) corresponding to theframes of the presentation, so visual representation of text, bars oredges of for the frames, for which the audio track has already beenmarked up into fragments (frames), is visually different frompresentation of text, bars or edges of bars for the frames, for whichthe audio track has not been marked up yet;

b) Mutual visual border of text content of the presentationcorresponding to splitting frames into the frames with marked up and notmarked up audio so that visual location of the border on the screenstays constant as a result of moving (i.e., scrolling) of the frame barsduring the mark up process.

2) To listen to an audio when synchronization of one frame ends andsynchronization of another frame begins:

a) In case of a manual markup, to insert a marker of an audio trackmarkup, for example, by pressing on image of text bar of the next frameand visually animating a set up of the markup, e.g., by changing thelook of the bar edge;

b) In case of an automatic speech recognition (the automatic matchingrecognizes words to texts of the frames and automatically marks up theaudio track), to check the accuracy of an automatic audio markup and, ifnecessary, to correct the markup.

Additionally, in all three of the above mentioned scenarios

(“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”,

“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”and“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”)when creating and editing graphic content of a frame, the followingoperations or their combinations are performed:

-   -   load graphic objects and images from an external source;    -   record video/take a picture by a camera integrated into the user        device and insert pictures and videos into visual content of the        frames;    -   automatically detect drawn figures and/or lines and correct        their geometry and, if the suggested version is approved, delete        the originally created figures and/or lines, replace them with        the edited figures and/or lines, and the duration of creating        and editing graphic figures is equal to the duration of creating        and editing originally created graphic figures and/or lines,        excluding the actual time for automatic correction and review of        suggested versions;    -   attach indicating (connective) lines and captions to figures and        automatically move matching lines and text along with figures        when moving them during the course of editing;    -   set a method of transition between visual imageries of two        adjacent frames;    -   playback a fragment of an audio track of a current frame in        order to check creation of visual imagery of the frame, drawing        and editing graphic figures;    -   perform a test animation of an already created part of visual        imagery of a frame in order to check creation of visual imagery        of a frame;    -   make integration (merging) of several graphic figures and/or        their elements into complex objects with the possibility of        following managing of these objects (moving, rotating, skewing,        rescaling, showing/hiding, etc.) as integrated images.

Additionally, in all three of the above mentioned scenarios(“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”,

“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”and“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”)when creating configuration file of a frame for synchronization ofanimation with audio, the time of visual representation of creating andediting graphic content of the frame visual imagery is determined by oneof the following ways:

a) From the start marker of the frame audio record within the audiotrack to the end marker of the frame audio (including time of pausesbefore the first word and after the last one);

b) From the start of the actual sound of the first word of the frametext to the end of the sound of the last word of the frame text(excluding time of the pauses before the first word and after the lastone);

Thus, in both cases a) and b) the additional delays (pauses) can beinserted before the start of the sound of the first word of the firstframe and after the end of the sound of the last word of the last frame.

Additionally, in all of the three above mentioned scenarios(“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”,

“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”and“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”)the correction of text, sound and visual imagery of frames is made asfollows:

1) If there is a visual imagery in the original frame when splitting theoriginal frame into two new frames:

-   -   to split the text of the frame by the text markup marker into        two fragments corresponding to two new frames;    -   to choose which one of the two new created frames will have the        visual imagery of the original frame;    -   to create visual imagery for the second frame created.

2) When integrating two adjacent original frames into one new frame:

a) If there is no visual content in the frames—delete markers of thetext markup and the audio track of the frames;

b) If there is imagery content only in one of the integratedframes—delete the marker of text markup and the audio track of framesand attach the visual imagery to the resulting integrated frame;

c) If there are visual imageries in both integrated frames:

-   -   delete the markers of text markup and the audio track of frames;    -   create the resulting audio track by connecting of two        consecutive original audio tracks keeping their original        sequence;    -   create the resulting visual imagery by consecutive connection of        two original visual imageries keeping their original sequence        and a frame rate of the original frames.

Additionally, in all three of the above mentioned scenarios(“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”,

“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”and“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”)the markup of the audio track of the presentation into frames can becorrected as follows:

-   -   determine two adjacent frames and location of the marker of the        audio track markup to be corrected;    -   display an image of audio track with a sound end marker;    -   playback continuous consistent sounds of both frames and        simultaneously display a time line with the locator of the        current position of marker of the audio track markup;    -   move the locator of the marker of audio track markup on a time        line right or left after listening to the end of one frame and        the beginning of the following one;    -   check the result after moving to a playback audio track from a        time mark N seconds to a new position of the marker of the audio        track markup where N is from 0 to 4 seconds.

Additionally, in all three of the above mentioned scenarios(“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”,

“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”and“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”)a transition between visual imageries of two adjacent frames whenopening (creating) a new frame is implemented in the following ways or acombination there of:

-   -   save the image of the previous frame without changes as a start        image of a new frame;    -   erase the image of the previous frame completely;    -   erase the entire image of the previous frame by moving an eraser        tool on a screen;    -   erase the image of the previous frame up to background (i.e., a        background image);    -   erase a part of screen field by the eraser tool (horizontally);    -   erase chosen elements of the image of the previous frame (choose        by elements);    -   restore a final image of the frame before the previous one        (cancel all changes of the image made on the previous frame);    -   move (rotate) a visual field (virtual image sheet) of the        previous frame, open a clean space for creating graphics of a        new frame and leave in sight a small portion of the image of the        previous frame;    -   minimize into an icon the resulting image of the previous frame        placing the icon on the screen field of a new frame (as a        reminder of the previous frame);    -   minimize the resulting image of the previous frame and leave it        on the screen field of a new frame (for drawing a new image        around the previous one);    -   extend the resulting image of the previous frame (a part of the        image is possibly out of the frame) and leave the part apparent        within the frame edges on the screen field of a new frame (for        drawing a new image inside the part of the previous one).

According to the exemplary embodiment, devices for creating multimediapresentations prototypes can be designed in versions “Mini” and“Screen.” A device 101 for creating multimedia presentations prototypesin the “Mini” version is designed in the form of a portable personalcomputing device (e.g., Smartphone, tablet, laptop), containing a touchscreen display 102 provided with a sensor detecting the position of afinger touch, on-screen or physical keyboard, built-in or externalmicrophone 105, built-in or external speaker 106, a data processing unitconnected to the display, to the keyboard, to the microphone and to thespeaker and designed with an option to receive, process and outputsignals and data.

Additionally, the device is designed with the following options:

-   -   when user touches the displayed text on the screen, process        positions of a finger touch as a command of the location of the        displayed text edges splitting into text fragments 402 and form        data structures in the device memory corresponding to the        specified fragments (frames);    -   split the recorded audio track into audio slices and connect        them with the text fragments (frames);    -   record a process of drawing by user's finger touch on the        display and connect them with the text fragments (frames);    -   scale (shrink/expand) proportionally on the display playback        time of drawing process by user's finger touch;    -   display on screen the recorded process of figures drawing        corresponding to the matching text fragments (frames) on a time        scale equal to the sound duration of the audio slices        corresponding to these text fragments (frames) with simultaneous        playback of these audio fragments.

Additionally, a data processing unit is designed with the option toperform the following operations or a combination thereof:

-   -   automatically detect verbal text pronunciation;    -   automatically divide the recorded audio track into audio slices        and connect these audio slices with the matching text fragments        (frames);    -   automatically detect drawn graphic figures and chose their        smoothed out and/or regular-shaped equivalents from the image        library.

A device for creating multimedia presentations prototypes in “Screen”version is designed in the form of portable personal computing device(e.g., Smartphone, tablet, laptop) containing:

-   -   a touch screen display provided with a sensor detecting the        position of finger touch designed with an option of displaying        on the screen the device mode of operations selection panel, so        that when different zones of the panel the device are touched,        the device switches to corresponding modes of operation;    -   a device on-screen mode of operations selection panel designed        in the form of a bar visually divided into the field-buttons        with symbolic representations of device operation modes (icons),        so when these field-buttons are pressed, they expand lengthwise        (unfold) and the sub-buttons available in these modes of        operations of the device appear;    -   an on-screen or physical keyboard;    -   a built-in or external microphone;    -   a built-in or external speaker;    -   a data processing unit connected to the display, to the        keyboard, to the microphone and to the speaker and designed with        option to receive, process and output signals and data.

Additionally, a device mode of operations selection panel 201 is locatedin the upper section of the screen across the entire width and isprovided with the fields-buttons “Presentation management” 204, “Textediting” 205, “Sound editing” 206, “Graphic editing” 207. A work fieldof active operation mode 202 is displayed under the selection panel onthe device screen. Thus, at launching (startup) of the device, if theprocessed multimedia presentation prototype is not chosen, the deviceswitches to the “Presentation management” mode and the icons (finaldrawings) available for review or editing of the presentationsprototypes with captions 301 are displayed in a work field. If theprototype is chosen at launch, the device switches to the “Text editing”mode of the chosen prototype and the text of prototype 401 is displayedin a work field.

Additionally, the field-button “Presentation management” 204 after it isopened (pressed) contains the sub-buttons “Create presentation” 303,“Open presentation for editing” 304, “Launch demonstration ofpresentation” 305, “Delete presentation” 306 “Save presentation as” 307.Thus, the icons (final drawings) of presentation prototypes withcaptions 301 are displayed in a work field.

Additionally, the field-button “Text editing” after it is opened(pressed) contains the sub-buttons “Text import” 403, “Text verbaldictation with recognition” 404, “Text input from on-screen keyboard”405, “Insert the marker of the frame text edge” 406. Thus, the text ofthe prototype 401 is displayed in a work field.

Additionally, the field-button “Sound editing” after it is opened(pressed) contains the sub-buttons “Import sound from file and markupsound into words” 501 and “Record sound from microphone and markup soundinto words” 502. Thus, the text of the prototype is visually split intosegments (bars) corresponding to fragments (frames) 503 of thepresentation and is displayed in a work field, so that the text part ofthe presentation, for which an audio track is already marked up intofragments (frames), visually differentiates from the part for whichmarkup is not done yet 504.

Additionally, field-button “Graphic editing” after it is opened(pressed) contains the sub-buttons “Turn on/turn off figures automaticrecognition mode” 601, “Turn on/turn off text narrator” 602, “Moveforward/back through frames” 603. Thus, the image sheet (canvas), toolsand palette 605 for creating and editing graphic images of the currentfragment (frame) 604 is displayed in a work field.

Additionally, if at the opening (pressing) of the “Sound editing” buttonthere are graphic images of the presentation fragments (frames), theicons of graphic images corresponding to these fragments (frames) aredisplayed in segments (bars) of a work field along with the text.Additionally, the segments (bars) of the work field are designed with anoption of being pressing by a finger so upon being pressed, thepronounceable or recognizable text is attached to the fragment (frame)of the text displayed in a current segment (bar) of the work field.

Additionally, the text displayed in the segments (bars) of the workfield is designed with an option of tactile selection and moving thetext from a segment (bar) of one fragment (frame) to another.Additionally, the sub-button “Move forward/back through frames” for theframe starting from the second is designed in the form of two buttons“Scroll forward/back” and “Select the look of transition between frames”so when pressed by finger, the button “Select the look of transitionbetween frames” a list of selections of the transition mode isdisplayed, which is designed as a set of the following buttons:

“Save frame image as starting image of the next frame,” “Erase frameimage entirely and instantly,” “Erase frame image entirely with delay,”“Erase frame image entirely with eraser tool,” “Erase frame image up tobackground,” “Erase part of frame image with eraser tool,” “Eraseselected elements of frame image,” “Restore final image of the framebefore the previous,” “Move (rotate) the image opening blank space tocreate figures of a new frame and leaving in sight a small part of theprevious frame image,” “Minimize frame image placing the icon on thefield of a new frame,” “Shrink frame image and leave it on the field ofa new frame,” “Expand frame image and leave it on the field of a newframe.”

Methods of using the device for creation of multimedia presentationsprototypes can be implemented in the scenarios:

“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”,

“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”and

“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”.

A method of using the device 101 for creation of multimediapresentations prototypes uses a personal computing device with akeyboard, a microphone 105, a speaker 106 and a touch screen 102.Control and selection of devices operation modes buttons 201 and a workfield of data processing 202 are displayed on the screens of thedevices. Methods of using the device includes entering text, sound andgraphic symbols, their processing and composition into an audiovisualfile of the multimedia presentation.

Method of using the device for creation of multimedia presentationsprototypes in a scenario“Text/Text_markup_into_frames/Sound/Sound_markup_into_frames/Graphics”is implemented as follows:

1) After launching, switch the device to the presentation managementmode;

2) Press Create presentation button 303 and switch the device tocreation of a presentation mode;

3) Name the presentation in creation of presentation mode, press textediting button and switch the device to creation of a text part of thepresentation mode;

4) At creation of the text part of presentation mode, press the buttoncorresponding to the way of the presentation of a text input in thedevice, as follows:

a) At importing text from external file—import a text button 403;

b) At text input from on-screen keyboard—show an on-screen keyboardbutton 405;

c) At verbal text input with automatic recognition—verbal dictation withrecognition button 404;

5) Enter text into the device, thus the entered text is placed in a workfield 202 of the device screen;

6) Insert markers of fragment (frame) 402 text edge, and in order topress the button, set the marker of a frame text edge button 406 andafter that press:

a) At importing text from an external file or upon entering a text fromthe on-screen keyboard—the text displayed in a work field inside themarkers of the frames' text edges;

b) At verbal text entering with automatic words recognition and placingmarkers of words edges—place markers of words edges inside the markersof the frames' text edges;

7) Press sound editing button 206 and switch the device to a soundediting mode of the presentation;

8) At sound editing mode, select a source of the sound so that dependingon the source the following buttons are pressed:

a) At importing sound from an external file—sound is imported by theexternal file button 501, by the external file location button and bythe launch/stop audio playback button;

b) At recording sound from microphone—record sound by microphone button502 and by launch/stop audio playback button.

9) At audio playback launch to display on the screen:

a) Text of sound frames (synchronized) at that particular momentvisually presented as segments (bars) corresponding to the frames of thepresentation, so visual representation of the text, bars or edges of theframes for which the audio track has already been marked up intofragments (frames) 503 is visually different from presentation of thetext, bars or edges of the frames for which the audio track has not beenmarked up yet 504;

b) Mutual visual border of the text content of the presentationcorresponds to the splitting frames with marked up and not marked upsounds 505 so that a visual location of the border on the screen staysconstant as a result of move (scrolling) of the frame bars during theprocess of marking up.

10) At audio playback, listen to the sound and when synchronization ofone frame ends and synchronization of another frame begins:

a) At manual markup—press on an image of a text bar of the next frameand insert the displayed marker of an audio track markup, thus,additionally visually animate setup of the markup, e.g., by changing alook of the bar edge;

b) At an automatic speech recognition—automatically match recognizedwords to the texts of the frames and automatically markup the audiotrack—to check the accuracy of the automatic sound markup. If an errorof recognition and markup is detected, press launch/stop audio playbackbutton and correct the markup.

11) Press graphic editing button 207 and switch the device to Graphicediting of the presentation frames mode, thus, image sheet (canvas),tools and palette 605 for creating and editing graphic images of theframe 604 is displayed in a work field starting from the first one.

12) In graphic editing mode, press buttons corresponding to selectedtools of graphics palette and create and edit graphic figures of theframe by touching the device screen in the area of sheet (canvas), thus,record a visual representation of all actions for creating and editingfigures.

13) In graphic editing mode the following buttons can be pressed:

a) Turn on/turn off figures automatic recognition mode 601;

b) Turn on/turn off text and/or audio narrator 602;

c) Move forward/back through frames 603;

d) Select the look of transition between frames.

14) Press demonstration of the presentation button 305 and switch thedevice to configuration and demonstration of the presentation mode,thus:

a) Determine time of frame playback at audio track for each frame;

b) Make uniform time scaling (shrink/expand) of visual representation ofcreating and editing graphic figures of visual imagery of each frameuntil the duration of the frame visual representation and the durationof its sound match;

c) Determine data of frame visualization (aspect ratio, resolution,color, transparency, FPS, etc.);

d) Launch via a speaker 106, a playback of an audio track of thepresentation and on the screen the time scaled visual representation ofcreating and editing graphic figures of the frames in their sequence andrecord the resulting mix of the sound and the animation into aconfigured audiovisual file of the presentation.

15) Review demonstration of the presentation and evaluate its level ofcompleteness and correspondence to the purposes of the presentation.According to the results of the evaluation:

a) Press the buttons of text, sound or graphic editing and move over totext, sound or graphic correction in this particular work session and torepeat configuration of the presentation;

b) Press “Save work” version of file button 307 and save theconfiguration file of the presentation in format allowing for futureediting of text, sound and visual imagery of the frames in the next worksession;

c) Press “Save as” video file button and convert the configuration fileof the presentation into a video file without possibility of futureediting of text, sound or images of the frames.

Method of using the device for creation of multimedia presentationsprototypes in a scenario“Text/Text_markup_into_frames/Graphics/Sound/Sound_markup_into_frames”is implemented as follows:

1) After launching, switch the device to the presentation managementmode;

2) Press Create presentation button and switch the device to creation ofthe presentation mode;

3) Name the presentation in creation of presentation mode, press thetext editing button and switch the device to creation of a text part ofthe presentation mode;

4) At creation of text part of the presentation mode, press the buttoncorresponding to the way of the presentation text input in the device,as follows:

a) At importing text from an external file—import text button 403;

b) At text input from an on-screen keyboard—show an on-screen keyboardbutton 405;

c) At verbal text input with automatic recognition—verbal dictation withrecognition button 404;

5) Enter text into the device, so that the entered text is placed in awork field of the device screen;

6) Insert markers of the fragment (frame) text edge, in order to pressthe button, set the marker of the frame text edge button 406 and afterthat press:

a) At importing the text from external file or entering the text from anon-screen keyboard—the text is displayed in a work field inside themarkers of the frames' text edges;

b) At verbal text entering with automatic words recognition and placingmarkers of words edges—place markers of the words edges inside themarkers of the frames' text edges.

7) Press a graphic editing button 207 and switch the device to Graphicediting of the presentation frames mode. Thus, image sheet (canvas),tools and palette for creating and editing graphic images is displayedin a work field starting from the first one.

8) In graphic editing mode, press buttons corresponding to selectedtools of graphics palette and create and edit graphic figures of theframe 604 by touching the device screen in the area of sheet (canvas).Thus, record a visual representation of all actions for creating andediting figures.

9) In graphic editing mode the following buttons can be pressed:

a) Turn on/off figures automatic recognition mode 601;

b) Turn on/off text and/or audio narrator 602;

c) Move forward/back through the frames 603;

d) Select the look of the transition between the frames;

10) Press sound editing button 206 and switch the device to a soundediting mode of the presentation;

11) In sound editing mode, select a source of the sound and depending onthe source press the following buttons:

a) At importing a sound from external file—sound import by an externalfile button 501, by an external file location button and by launch/stopan audio playback button;

b) At recording a sound from a microphone—record the sound from themicrophone button 502 and launch/stop audio playback button;

12) At the audio playback, launch to display on screen:

a) Text of sound frames (synchronized) at that particular momentvisually presented as segments (bars) corresponding to the frames of thepresentation, so visual representation of text, bars or edges of for theframes for which the audio track has already been marked up intofragments (frames) 503 is visually different from presentation of text,bars or edges of bars for the frames for which the audio track has notbeen marked up yet;

b) Mutual visual border of the text content of the presentation 505corresponding to splitting sound frames marked up and not marked up sothat a visual location of the border on screen stays constant as aresult of move (scrolling) of the frame bars in the process of markingup.

13) At audio playback, listen to the sound, and when a synchronizationof one frame ends a synchronization of another frame begins:

a) At manual markup—press on image of a text bar of the next frame andinsert the displayed marker of the audio track markup. Thus,additionally visually animate a setup of the markup, e.g., by changingthe look of a bar edge;

b) At automatic speech recognition, automatically match recognized wordsto the texts of the frames and automatically markup the audio track—tocheck accuracy of an automatic sound markup, and if an error ofrecognition and markup is detected, press launch/stop audio playbackbutton and correct the markup;

14) Press a demonstration of the presentation button 305 and switch thedevice to configuration and demonstration of the presentation mode,thus:

a) Determine time of a frame playback within an audio track for eachframe;

b) Make uniform time scaling (shrink/expand) of visual representationfor creating and editing graphic figures of visual imagery of each frameuntil the duration of the frame visual representation and the durationof its sound match;

c) Determine data of frame visualization (aspect ratio, resolution,color, transparency, FPS, etc.);

d) Launch, via a speaker 106 playback, an audio track of thepresentation and display on the screen the time-scaled visualrepresentation of creating and editing graphic figures of the frames intheir sequence and record the resulting mix of the sound and theanimation into a configured audiovisual file of the presentation.

15) Review demonstration of the presentation and evaluate its level ofcompleteness and correspondence to the purposes of the presentation.According to the results of the evaluation:

a) Press the buttons of text, sound or graphic editing and move over totext, sound or graphic correction in this particular work session and torepeated configuration of the presentation;

b) Press “Save work” version of file button and save the configurationfile of the presentation in format allowing for future editing of text,sound and visual imagery of the frames in the next work session;

c) Press “Save as” video file button 307 and convert the configurationfile of the presentation into a video-file without possibility of futureediting of text, sound or images of the frames.

Method of using the device for creating multimedia presentationsprototypes in a scenario“Sound/Text/Text_markup_into_frames/Sound_markup_into_frames/Graphics”is implemented as follows:

1) After launching, switch the device to the presentation managementmode;

2) Press Create presentation button 303 and switch the device tocreation of the presentation mode;

3) Name the presentation in a creation of the presentation mode;

4) Press the sound editing button 206 and switch the device to a soundediting mode of the presentation;

5) In sound editing mode, select a source of the sound, and depending onthe source press the following buttons:

a) At importing sound from an external file—import a sound by anexternal file button, by an external file location button and by alaunch/stop audio playback button;

b) At recording a sound from a microphone—record the sound from themicrophone button and from a launch/stop audio playback button;

6) Press a text editing button 205 and switch the device to creation ofa text part of the presentation mode;

7) In creation of a text part of the presentation mode, press the buttoncorresponding to the way of the presentation text input in the device,as follows:

a) At importing text from external file—an import text button 403;

b) At text input from an on-screen keyboard—show the on-screen keyboardbutton 405;

c) At verbal text input with an automatic recognition—verbal dictationwith the recognition button 404.

8) Enter a text into the device—the entered text is placed in a workfield of the device screen;

9) Insert markers of a fragment (frame) text edge 402—in order to pressset the marker of frame text edge button 406 and after that press:

a) At importing the text from an external file or entering the text froman on-screen keyboard—on the text displayed in a work field inside themarkers of frames text edges;

b) At verbal text entering with automatic words recognition and placingmarkers of the words edges—on markers of the words edges inside themarkers of the frames' text edges.

10) Press sound editing button 206 and switch the device to a soundediting mode of the presentation, thus display on the screen:

a) Text of the sound frames (synchronized) at that particular momentvisually presented as segments (bars) corresponding to the frames of thepresentation, so visual representation of text, bars or edges of theframes for which the audio track has already been marked up intofragments (frames) 503 is visually different from the presentation oftext, bars or edges of bars of the frames for which the audio track hasnot been marked up yet 504;

b) Mutual visual border of the text content of the presentation 505corresponds to splitting frames into the frames with marked up and notmarked up sound so that the visual location of the border on the screenstays constant as a result of moving (scrolling) of the frame bars inthe process of marking up.

11) At audio playback—listen to the sound and when synchronization ofone frame ends, the synchronization of another frame begins:

a) At manual markup, press on an image of a text bar of the next frameand insert the displayed marker of an audio track markup, therebyadditionally visually animate setup of the markup, e.g., by changing alook of the bar edge;

b) At automatic speech recognition, automatically match recognized wordsto the texts of the frames and automatically markup the audio track—tocheck accuracy of the automatic sound markup, and if an error ofrecognition and markup is detected, press a launch/stop audio playbackbutton and correct the markup;

12) Press graphic editing button 207 and switch the device to thegraphic editing of the presentation frames mode, thus image sheet(canvas), tools and palette 605 for creating and editing graphic imagesof the frame 604 is displayed in a work field starting from the firstone;

13) In the graphic editing mode, press buttons corresponding to selectedtools of the graphics palette and create and edit graphic figures of theframe by touching the device screen in the area of sheet (canvas), thusrecord a visual representation of all actions for creating and editingfigures.

14) In graphic editing mode the following buttons can be pressed:

a) Turn on/off figures automatic recognition mode 601;

b) Turn on/off text and/or audio narrator 602;

c) Move forward/back through frames 603;

d) Select the look of the transition between the frames.

15) Press demonstration of the presentation button 305 and switch thedevice to configuration and demonstration of the presentation mode,thus:

a) Determine duration of frame playback within an audio track for eachframe;

b) Make uniform time scaling (shrink/expand) of visual representation ofcreating and editing graphic figures of visual imagery of each frameuntil the duration of the frame visual representation and the durationof its sound match;

c) Determine data of a frame visualization (aspect ratio, resolution,color, transparency, FPS, etc.);

d) Launch, via a speaker 106, a playback of an audio track of thepresentation and launch on the screen the time scaled visualrepresentation of creating and editing graphic figures of the frames intheir sequence and record the resulting mix of the sound and theanimation into a configuration audiovisual file of the presentation.

16) Review demonstration of the presentation and evaluate its level ofcompleteness and correspondence to the purposes of the presentation.According to the results of the evaluation:

a) Press the buttons of text, sound or graphic editing and move over totext, sound or graphic correction in this particular work session and torepeated configuration of the presentation;

b) Press “Save work” version of file button and save the arrangementfile of the presentation in a format allowing for future editing oftext, sound and visual imagery of the frames in the next work session;

c) Press “Save as” video file button and convert the configuration fileof the presentation into a video-file without possibility of futureediting of text, sound or images of the frames.

Those skilled in the art will appreciate that the claimed invention,advantageously, reduces high manpower effort and complexity of creatingmultimedia presentation for personal and professional use.

With reference to FIG. 7, an exemplary system for implementing theinvention includes a general purpose computing device in the form of acomputer 101 or the like, including a processing unit 21, a systemmemory 22, and a system bus 23 that couples various system componentsincluding the system memory to the processing unit 21.

The system bus 23 may be any of several types of bus structuresincluding a memory bus or memory controller, a peripheral bus, and alocal bus using any of a variety of bus architectures. The system memoryincludes a read-only memory (ROM) 24 and random access memory (RAM) 25.A basic input/output system 26 (BIOS), containing the basic routinesthat help to transfer information between the elements within thepersonal computer 101, such as during start-up, is stored in ROM 24.

The computer 101 may further include a hard disk drive 27 for readingfrom and writing to a hard disk, not shown herein, a magnetic disk drive28 for reading from or writing to a removable magnetic disk 29, and anoptical disk drive 30 for reading from or writing to a removable opticaldisk 31 such as a CD-ROM, DVD-ROM or other optical media. The hard diskdrive 27, magnetic disk drive 28, and optical disk drive 30 areconnected to the system bus 23 by a hard disk drive interface 32, amagnetic disk drive interface 33, and an optical drive interface 34,respectively.

The drives and their associated computer-readable media providenon-volatile storage of computer readable instructions, data structures,program modules and other data for the personal computer 101. Althoughthe exemplary environment described herein employs a hard disk, aremovable magnetic disk 29 and a removable optical disk 31, it should beappreciated by those skilled in the art that other types of computerreadable media that can store data that is accessible by a computer,such as magnetic cassettes, flash memory cards, digital video disks,Bernoulli cartridges, random access memories (RAMs), read-only memories(ROMs) and the like may also be used in the exemplary operatingenvironment.

A number of program modules may be stored on the hard disk, magneticdisk 29, optical disk 31, ROM 24 or RAM 25, including an operatingsystem 35 (e.g., Microsoft Windows™ 2000). The computer 101 includes afile system 36 associated with or included within the operating system35, such as the Windows NT™ File System (NTFS), one or more applicationprograms 37, other program modules 38 and program data 39. A user mayenter commands and information into the personal computer 101 throughinput devices such as a keyboard 40 and pointing device 42.

Other input devices (not shown) may include a microphone, joystick, gamepad, satellite dish, scanner or the like. These and other input devicesare often connected to the processing unit 21 through a serial portinterface 46 that is coupled to the system bus, and they may also beconnected by other interfaces, such as a parallel port, game port oruniversal serial bus (USB). A monitor 47 or other type of display deviceis also connected to the system bus 23 via an interface, such as a videoadapter 48. In addition to the monitor 47, personal computers typicallyinclude other peripheral output devices (not shown), such as speakersand printers.

The personal computer 101 may operate in a networked environment usinglogical connections to one or more remote computers 49. The remotecomputer (or computers) 49 may be another personal computer, a server, arouter, a network PC, a peer device or other common network node, and ittypically includes some or all of the elements described above relativeto the personal computer 101, although here only a memory storage device50 is illustrated. The logical connections include a local area network(LAN) 51 and a wide area network (WAN) 52. Such networking environmentsare common in offices, enterprise-wide computer networks, Intranets andthe Internet.

In a LAN environment, the personal computer 101 is connected to thelocal network 51 through a network interface or adapter 53. When used ina WAN networking environment, the personal computer 101 typicallyincludes a modem 54 or other means for establishing communications overthe wide area network 52, such as the Internet.

The modem 54, which may be internal or external, is connected to thesystem bus 23 via the serial port interface 46. In a networkedenvironment, the program modules depicted relative to the personalcomputer 101, or portions thereof, may be stored in the remote memorystorage device. It will be appreciated that the network connectionsshown are merely exemplary and other means of establishing acommunications link between the computers may be used.

Having thus described a preferred embodiment, it should be apparent tothose skilled in the art that certain advantages of the described methodhave been achieved.

It should also be appreciated that various modifications, adaptations,and alternative embodiments thereof may be made within the scope andspirit of the present invention. The invention is further defined by thefollowing claims.

What is claimed is:
 1. A computer-implemented method for creating ananimation, the method comprising: using a processor of a touchscreendevice, defining animation sequences and an order of transition betweenthe animation sequences; preparing a composition of a visual content andan audio track of the animation by performing (a) on the touchscreendevice, drawing dynamic elements corresponding to the animationsequences, wherein the dynamic elements are added to a static backgroundimage of each animation sequence, by recording a process of the drawingthat uses a finger or a stylus on the touchscreen device; (b) entering atext to be narrated as the audio track into the touchscreen device; (c)specifying boundaries between fragments of the text by using a stylus ora finger to manually indicate the boundaries, wherein the fragments ofthe text correspond to neighboring animation sequences, and placingmarkers of a text markup at locations of the boundaries; (d) recordingthe audio track, using a microphone of the touchscreen device, bynarrating the text so that pauses in the audio track correspond to themarkers and represent transitions between the neighboring animationsequences; (e) splitting the audio track into portions associated withthe animation sequences during the narration; (f) creating aconfiguration file for each animation sequence by determining a playbackduration of the corresponding animation sequence and by making a uniformtime scaling of the visual representation of the animation, so that aduration of the visual representation is adjusted to match a duration ofthe audio track recorded through the microphone; (g) setting an aspectratio, resolution, color, transparency and FPS of the animation;simultaneously playing back, on the touchscreen device, the audio trackand a time scaled visual representation of the dynamic elements as theywere being drawn, and recording a resulting mix of the audio track andthe dynamic elements into a configuration file of each animationsequence; playing back the configuration file of the animation sequenceon the touchscreen device; combining the configuration files of theanimation sequences in the order of transition of the animationsequences into a configuration multimedia file of the animation; andsaving the configuration multimedia file of the animation.
 2. The methodof claim 1, further comprising editing the text, the audio track and thedynamic elements and re-configuring the animation film.
 3. The method ofclaim 1, wherein the text is entered into the touchscreen device by anyof: loading prepared text from an external source; entering the textusing an on-screen keyboard; and inserting the text verbally via amicrophone with speech-to-text recognition.
 4. The method of claim 1,further comprising marking up the audio track, including the steps of:displaying the text and audio frames synchronized at a particular momentin time visually presented as segments corresponding to the animationsequences, so that a visual representation of the text, bars or edges ofthe animation sequences for which the audio track has already beenmarked up is visually different from a presentation of the text, bars oredges of the animation sequences for which the audio track has not yetbeen marked up; and displaying a visual border of the text correspondingto splitting animation sequences into the animation sequences withmarked up and not marked up audio content, so that a visual location ofthe border on a screen stays constant as a result of scrolling of theframe bars in a process of marking up; inserting a marker of an audiotrack markup where synchronization of one animation sequence ends andsynchronization of another animation sequence begins in case of a manualmarkup; and automatically matching recognized words to texts of theanimation sequences and automatically marking up the audio track in caseof an automatic speech recognition.
 5. The method of claim 1, furthercomprising creation and editing dynamic elements of a animation sequenceby: automatically detecting drawn dynamic elements and correcting theirgeometry, wherein a duration of drawing of the dynamic elements is equalto a duration of the drawing of the original dynamic elements, excludingan actual time for automatic correction and reviewing suggestedversions; attaching indicating lines and captions to the dynamicelements and automatically move matching lines and text along with thedynamic elements when moving them during editing; setting a transitionmode between the dynamic elements of two adjacent animation sequences;playing back a fragment of the audio track of a current animationsequence in order to check creation of the dynamic elements of theanimation sequence; and testing an animation of an already created partof the visual imagery of the animation sequence in order to check thecreation of the dynamic elements of the animation sequence.
 6. Themethod of claim 1, further comprising creating a configuration file ofthe animation sequence for synchronization of animation with audio,wherein a time of visual representation of creating and editing dynamicelements of is determined: from a start marker of the audio track to theend marker of the audio track; and from a start of an actual sound of afirst word of the text of the animation sequence to of a end of a soundof a last word of the text of the animation sequence, wherein additionalpauses are inserted before the start of the sound of the first word ofthe first animation sequence and after ending of the sounding of thelast word of the last animation sequence.
 7. The method of claim 1,further comprising correction of the animation sequences by: manuallyindicating a boundary of two text fragments for two new neighboringanimation sequences, and placing markers of a text markup at a locationof the boundary; determining which one of the two new created animationsequence has the dynamic elements of an original animation sequence;creating dynamic elements for the second animation sequence; deletingmarkers of text markup and audio track of the animation sequences, ifthere are no dynamic elements found in the animation sequences; deletingthe marker of the text markup and the audio track of the animationsequences and attaching the dynamic elements to the resulting integratedanimation sequence, if the dynamic elements are present only in one ofthe integrated animation sequences; and if there are no dynamic elementspresent in both integrated animation sequences: deleting the markers ofthe text markup and the audio track of the animation sequences; creatingthe resulting audio track by consecutive connection of two originalaudio tracks keeping their original sequence; and creating resultingdynamic elements by consecutive connection of two original dynamicelements keeping their original sequence and a frame rate of theoriginal animation sequences.
 8. The method of claim 1, furthercomprising correcting markup of the audio track by: determining twoadjacent animation sequences and location of a marker of the audio trackmarkup to be corrected; displaying an image of the audio track with asound end marker; playing back continuous consistent sounds of bothanimation sequences and simultaneously displaying a time line with alocator of a current position of the marker of the audio track markup;moving a locator of the marker of audio track markup on time line rightor left after listening to an end of one animation sequence andbeginning of the following one; and checking result after moving toplayback audio of the track from time mark N seconds to a new positionof marker of audio track markup, where N is from 0 to 4 seconds.
 9. Themethod of claim 1, further comprising executing a transition between twoadjacent animation sequences when creating a new frame by: saving animage of the previous animation sequence without changes as a startimage of a new animation sequence; erasing the image of the previousanimation sequence completely; erasing the entire image of the previousanimation sequence by moving an eraser tool on a screen; erasing theimage of the previous animation sequence up to a background image;erasing a part of a screen field by the eraser tool horizontally;erasing chosen elements of the image of the previous animation sequence;restoring a final image of the animation sequence before the previousone; rotating a visual field of the previous animation sequence; openinga clean space for creating dynamic elements of the new animationsequence and leaving in sight a small part of the image of the previousframe; minimizing into an icon the resulting image of the previousanimation sequence and placing the icon on the screen field of the newanimation sequence; minimizing the resulting image of the previousanimation sequence and leaving it on the screen field of the newanimation sequence; and extending the resulting image of the previousanimation sequence and leaving the part visible within the frame edgeson the screen field of the new animation sequence.
 10. A system forcreating an animation, the system comprising: a touchscreen devicehaving a processor, the processor configured to define animationsequences and an order of transition between the animation sequences;the processor configured to prepare a composition of a visual contentand an audio track of the animation and combining them into an animationfile, preparing a composition of a visual content and an audio track ofthe animation and combining them into an animation file, by performing(a) on the touchscreen device, drawing dynamic elements corresponding tothe animation sequences, wherein the dynamic elements are added to astatic background image of each animation sequence, and recording avisual representation of a process of the drawing; (b) preparing a textto be narrated as the audio track and entering the text into thetouchscreen device; (c) specifying boundaries between fragments of thetext by using a stylus or a finger to manually indicate the boundaries,wherein the fragments of the text correspond to neighboring animationsequences, and placing markers of a text markup at locations of theboundaries; (d) recording the audio track, using a microphone of thetouchscreen device, by narrating the text so that pauses in the audiotrack correspond to the markers and represent transitions between theneighboring animation sequences; (e) splitting the audio track intoportions associated with the animation sequences during the narration;(f) creating a configuration file for each animation sequence bydetermining a playback duration of the animation sequence and by makinga uniform time scaling of the visual representation of the animation, sothat a duration of the visual representation is adjusted to match aduration of the audio track recorded through the microphone; (g) settingan aspect ratio, resolution, color, transparency and FPS of theanimation; simultaneously playing back, on the touchscreen device, theaudio track and a time scaled visual representation of the dynamicelements as they were being drawn, and recording a resulting mix of theaudio track and the dynamic elements into a configuration file of eachanimation sequence; playing back the configuration file of the animationsequence on the touchscreen device; combining the configuration files ofthe animation sequences in the order of transition of the animationsequences into a configuration multimedia file of the animation; andsaving the configuration multimedia file of the animation.
 11. Thesystem of claim 10, wherein the processor is configured to:automatically detect verbal text pronunciation; automatically divide therecorded audio track into audio fragments and connect these audiofragments with the matching text sequences; and automatically detectdrawn graphic figures and select their smoothed out and regular-shapedequivalents from an image library.
 12. The system of claim 10, whereinupon detecting a touch on different areas of the touch screen device, anon-screen mode of an operation selection panel is presented in a form ofa bar divided into field-buttons with icons of operation modes, whereinwhen the field-buttons are pressed on, they expand lengthwise andsub-buttons available in these modes of operations of the touchscreendevice appear; and wherein: a mode of the operations selection panel islocated in an upper section of the screen across the entire width and isprovided with the fields-buttons “Presentation management”, “Textediting”, “Sound editing”, and “Graphic editing”; a work field of anactive operation mode is displayed under the selection panel on thetouchscreen device; at startup of the touchscreen device, if a processedprototype is not selected, the touchscreen device switches to the“Presentation management” mode and the icons available for review orediting of prototypes with captions are displayed in a work field; andif the prototype is selected at launch, the touchscreen device switchesto the “Text editing” mode of the selected prototype, thus the text ofthe prototype is displayed in the work field.
 13. The system of claim10, wherein the field-button “Presentation management” containssub-buttons “Create presentation”, “Open presentation for editing”,“Launch demonstration of presentation”, “Delete presentation”, “Savepresentation as” and wherein the icons of the presentation prototypeswith captions are displayed in the work field.
 14. The system of claim10, wherein the field-button “Text editing” contains sub-buttons “Textimport”, “Text verbal dictation with recognition”, “Text input fromon-screen keyboard”, “Insert the marker of the frame text edge.”
 15. Thesystem of claim 10, wherein the field-button “Sound editing” containssub-buttons “Import sound from file and markup sound into words”,“Record sound from microphone and markup sound into words”, wherein thetext of the prototype is split into segments corresponding to fragmentsof the animation and is displayed in the work field, so that a text partof the animation for which an audio track is already marked up intofragments is different from a part for which markup is not done yet. 16.The system of claim 10, wherein the field-button “Graphic editing”contains sub-buttons “Turn on/turn off figures automatic recognitionmode”, “Turn on/turn off text narrator”, “Move forward/back throughframes”, wherein an image sheet, tools and a palette for creating andediting graphic images of a current frame is displayed in the workfield.
 17. The system of claim 10, wherein upon pressing on the “Soundediting” button, the icons of graphic images corresponding to the framesare displayed in the segments of a work field along with the text. 18.The system of claim 17, wherein when the segments of the work field arepressed on, a pronounceable or a recognizable text is attached to theframe text displayed in a current segment of the work field.
 19. Thesystem of claim 17, wherein a sub-button “Move forward/back throughframes” for the frame starting from the second is implemented in a formof two buttons “Scroll forward/back” and “Select the look of transitionbetween frames”, wherein when the button “Select the look of transitionbetween frames” is pressed, a list of the selection of the transitionmode is displayed as a set of the buttons: “Save frame image as startingimage of the next frame”, “Erase frame image entirely and instantly”,“Erase frame image entirely with delay”, “Erase frame image entirelywith eraser tool”, “Erase frame image up to background”, “Erase part offrame image with eraser tool”, “Erase selected elements of frame image”,“Restore final image of the frame before the previous”, “Move (rotate)the image opening blank space to create figures of a new frame andleaving in sight a small part of the previous frame image”, “Minimizeframe image placing the icon on the field of a new frame”, “Shrink frameimage and leave it on the field of a new frame”, “Expand frame image andleave it on the field of a new frame”.
 20. A computer program productcomprising a computer-readable non-transitory medium containing computercode for creating an animation file by performing the steps of: (a)defining animation sequences of the animation file on a touchscreendevice, each animation sequence including a static background image; (b)drawing dynamic elements corresponding to the animation sequences,wherein the dynamic elements are added to the static background image ofeach animation sequence, and recording a process of the drawing thatuses a finger or a stylus on the touchscreen device; (c) entering a textto be narrated into the touchscreen device; (d) specifying boundariesbetween fragments of the text by using a stylus or a finger to manuallyindicate the boundaries, wherein the fragments of the text correspond toneighboring animation sequences, and placing markers of a text markup atlocations of the boundaries; (e) recording an audio track, using amicrophone of the touchscreen device, by narrating the text so thatpauses in the audio track correspond to the markers and representtransitions between the neighboring animation sequences; (f) splittingthe audio track into portions associated with the correspondinganimation sequences; (g) creating a configuration file for eachanimation sequence by determining a playback duration of the animationsequence and by making a uniform time scaling of the visualrepresentation of the each animation sequence so that a duration of thevisual representation is adjusted to match a duration of the audiotrack; (h) setting a transparency and FPS of the animation sequences;(i) simultaneously playing back, on the touchscreen device, the audiotrack and a time scaled visual representation of the dynamic elements asthey were being drawn, and recording a resulting mix of the audio trackand the dynamic elements into a configuration file of each animationsequence; (j) playing back the configuration files of the animationsequences on the touchscreen device; (k) combining the configurationfiles of the animation sequences into a configuration multimedia file;and (l) saving the configuration multimedia file.